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Multiple-input  multiple-output  (MIMO)  underwater  acoustic  (UWA)  channels  introduce  both  space-time  interference  (STI)  and 
time-varying  phase  distortion  for  transmitted  signals.  In  such  cases,  the  equalized  symbols  produced  by  conventional  equalizer 
aiming  for  STI  cancelation  suffer  phase  rotation  and  thus  cannot  be  reliably  detected.  In  this  paper,  we  propose  a  new  equalization 
scheme  for  high  data  rate  single  carrier  MIMO  UWA  channels.  Different  from  existing  methods  employing  joint  equalization 
and  symbolwise  phase  tracking  technology,  the  proposed  scheme  decouples  the  interference  cancelation  (IC)  operation  and  the 
phase  compensation  operation,  leading  to  a  generalized  equalizer  structure  combining  an  IC  equalizer  with  a  phase  compensator. 
The  decoupling  of  the  two  functionalities  leads  to  robust  signal  detection,  which  is  most  desirable  in  practical  UWA  applications. 
MIMO  linear  equalizer  (LE)  is  adopted  to  remove  space-time  interference,  and  a  groupwise  phase  estimation  and  correction 
method  is  used  to  compensate  the  phase  rotation.  In  addition,  the  layered  space-time  processing  technology  is  adopted  to  enhance 
the  equalization  performance.  The  proposed  equalization  scheme  is  tested  to  be  very  robust  with  extensive  experimental  data 
collected  at  Kauai,  Hawaii,  in  September  2005,  and  Saint  Margaret’s  Bay,  Nova  Scotia,  Canada,  in  May  2006. 


1.  Introduction 

Underwater  acoustic  (UWA)  channel  is  recognized  as  one 
of  the  most  challenging  channels  in  practical  use  [1]. 
The  obstacles  imposed  by  the  water  media  on  acoustic 
propagation  are  reflected  in  four  aspects.  First,  the  available 
channel  bandwidth  is  very  limited  due  to  the  frequency- 
dependent  attenuation.  For  example,  in  medium-range 
UWA  communications,  the  bandwidth  is  on  the  order  of 
a  few  tens  of  kilohertz.  Second,  the  channel  delay  spread 
is  very  long  due  to  the  rich  scattering  environment.  For 
instance,  it  could  be  over  several  tens  of  milliseconds  (ms), 
leading  to  an  equivalent  discrete-time  channel  with  several 
tens  or  even  hundreds  of  channel  taps  in  contrast  to  less 
than  twenty  taps  in  radio  frequency  (RF)  communications. 
Third,  the  Doppler  effect  is  very  significant  due  to  the 
low  propagation  speed  of  sound  (about  1500  m/s  in  water). 


On  one  hand,  the  motion-induced  Doppler  shift  causes  a 
normalized  carrier  frequency  offset  (CFO)  on  the  order  of 
10-4  to  1CT3,  compared  to  10-8  to  10~6  in  RF  channels.  On 
the  other  hand,  the  motion-induced  waveform  compression 
or  dilation  incurs  nonnegligible  symbol  offset  requiring 
signal  resampling.  Finally,  the  temporal  variation  of  the 
UWA  channel  is  very  fast  due  to  the  dynamics  of  the  water 
mass,  which  imposes  difficulty  on  both  channel  estimation 
and  phase  tracking. 

In  the  past  three  decades,  significant  progress  has 
been  achieved  in  UWA  communications  [2-14],  Earlier 
UWA  communications  adopted  noncoherent  frequency- shift 
keying  (FSK)  technology  which  enabled  simple  energy- 
based  signal  detection  combating  the  unpleasant  effect  of 
channel  reverberation  [2],  The  drawback  of  FSK  lies  in  its 
low  transmission  rate  and  also  low  bandwidth  efficiency. 
The  partial  coherent  modulation  of  differential  phase  shift 
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keying  (DPSK)  was  then  chosen  to  achieve  a  bandwidth 
efficiency  between  noncoherent  and  fully  coherent  sys¬ 
tems.  It  was  until  early  1990s,  UWA  transmission  using 
the  bandwidth-efficient  coherent  modulation  appeared  in 
[3].  Different  from  FSK,  coherent  transmission  adopting 
modulations  like  phase  shift  keying  (PSK)  and  quadrature 
amplitude  modulation  (QAM)  requires  proper  cancelation 
of  intersymbol  interference  (ISI)  and  compensation  of 
phase  distortion  in  signal  detection;  both  tasks  become 
very  difficult  under  hash  UWA  channel  conditions.  Passive- 
phase  conjugation  (PPC)  [4]  and  time  reversal  (TR)  [5] 
technologies,  both  having  different  principles  from  equal¬ 
ization,  have  been  proposed  to  mitigate  ISI.  The  detection 
using  PPC  technology,  however,  has  poor  performance 
when  only  a  small  number  of  receiving  hydrophones  are 
available  [6].  As  a  result,  equalization  technology  is  more 
commonly  adopted  for  coherent  detection.  Generally,  equal¬ 
ization  can  be  performed  in  either  time-domain  (TD)  [7- 
11]  or  frequency-domain  (FD)  [12-14].  In  [7,  8],  the 
classic  joint  design  of  decision  feedback  equalizer  (DFE) 
and  phase-locked  loop  (PLL)  has  been  proposed,  and  its 
iterative  implementation  can  be  found  in  [9].  In  [10],  by 
coupling  PPC  technology  with  a  single-channel  DFE,  the 
correlation-based  DFE  is  proposed  and  tested  to  be  robust 
to  different  acoustic  environments.  The  phase  rotation  is 
tracked  with  PLL  technology.  In  [11],  linear  equalization 
combined  with  proper  phase  compensation  has  been  pro¬ 
posed  to  minimize  error  propagation  due  to  incorrect 
decision  feedback.  In  [12],  single-carrier  frequency-domain 
equalization  (SC-FDE)  followed  by  phase  compensation 
is  proposed  for  single-carrier  systems,  and  the  frequency- 
domain  equalization  for  multiple-carrier  orthogonal  fre¬ 
quency  division  multiplexing  (OFDM)  systems  has  been 
proposed  in  [13,  14].  In  [13],  equalization  is  performed  with 
adaptive  channel  estimation  and  phase  tracking  method. 
In  [14],  pilot-aided  channel  estimation  is  adopted  for 
equalization,  and  a  two-step  Doppler  compensation  is 
adopted  to  remove  phase  rotation.  While  FD  equalization 
enables  low- complexity  implementation  even  over  highly 
dispersive  channel,  it  usually  requires  extra  guard  intervals 
(GIs)  among  transmission  blocks,  which  sacrifices  the  data 
transmission  efficiency.  Moreover,  the  inherent  sensitivity 
of  OFDM  systems  to  carrier  frequency  offset  makes  robust 
signal  detection  very  challenging  especially  with  moving 
transceivers. 

Despite  the  diverse  equalization  schemes,  the  demon¬ 
strated  data  rate  of  UWA  communication  is  relatively  low  due 
to  the  natural  limitation  on  the  available  channel  bandwidth. 
In  recent  years,  researchers  have  started  to  explore  the 
spatial  structure  of  the  oceans  to  fundamentally  improve 
the  transmission  rate.  In  the  past  years,  MIMO  UWA 
communications  have  been  investigated  in  [15-19].  In  [15], 
the  number  of  the  available  degrees  of  freedom  in  the  UWA 
channel  is  studied.  In  [16],  coherent  MIMO  transmission 
has  been  presented,  using  a  time  reversal  approach.  In  [17], 
the  joint  DFE  and  PLL  scheme  originally  proposed  in  [7] 
for  single  input  multiple  output  (SIMO)  systems  has  been 
extended  to  MIMO  cases.  In  [18],  turbo  linear  equalization 
has  been  used  for  MIMO  UWA  communication.  The  FD 
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equalization  schemes  proposed  in  [12,  14]  have  also  been 
extended  to  MIMO  systems  in  [19,  20],  respectively. 

In  this  paper,  we  propose  a  new  time-domain  MIMO 
equalization  scheme  for  single  carrier  UWA  communica¬ 
tions.  Different  from  conventional  schemes  performing  joint 
equalization  and  phase  tracking  [3,  7-9,  17],  where  the 
requirement  for  careful  tuning  of  DFE  and  PLL  parameters 
makes  the  system  less  stable  [1],  the  new  scheme  decou¬ 
ples  the  interference-cancelation  functionality  and  phase- 
synchronization  functionality  leading  to  a  generalized  equal¬ 
izer  structure  consisting  of  an  interference-cancelation  (IC) 
equalizer  and  a  phase  compensator.  MIMO  linear  equalizer 
(LE)  is  adopted  to  achieve  low-complexity  equalization 
and  also  to  avoid  the  error  propagation  in  DFE  especially 
under  harsh  channel  conditions.  A  novel  groupwise  phase 
estimation  and  correction  method  proposed  in  [21],  which 
is  insensitive  to  noise  disturbance,  is  used  to  compensate 
phase  rotations  in  the  equalized  symbols.  The  proposed 
equalization  scheme  has  been  adopted  in  a  layered  receiver 
structure  for  UWA  communications  and  has  been  tested 
by  high-rate  MIMO  experimental  data  measured  off  the 
northwestern  coast  of  Kauai,  Hawaii,  in  September  2005,  and 
by  both  moving-source  and  fixed-source  SIMO  experimental 
data  measured  at  Saint  Margaret’s  Bay,  Nova  Scotia,  Canada, 
in  May  2006.  We  have  achieved  successful  equalization  in 
both  experiments. 

The  rest  of  the  paper  is  organized  as  follows.  In  Section  2, 
a  general  MIMO  UWA  system  model  is  given.  Based  on  the 
system  model,  MIMO  channel  estimation  is  introduced  in 
Section  3,  as  the  basis  for  equalizer  design.  The  new  MIMO 
equalization  scheme  is  then  developed  in  Section  4,  where 
the  MIMO  IC  equalization,  the  groupwise  phase  estimation 
and  compensation,  and  the  layered  space-time  processing 
technology  are  discussed,  respectively.  Section  5  reports  the 
results  on  experimental  data  processing,  and  conclusion  is 
drawn  in  Section  6. 

2.  MIMO  System  Model 

For  a  MIMO  UWA  communication  system  employing  N 
transmitting  transducers  and  M  receiving  hydrophones, 
the  baseband  discrete-time  signal  received  at  the  mth 
hydrophone  is  represented  as 

N  L-l 

ym(k)  =  X  JhnAk,  Wn(k  -  l)e^k)  +  vm(k),  (1) 

n=  1  1=0 

where  xn(k)  is  the  effective  transmission  symbol  of  the  nth 
transducer  observed  at  the  receiver,  hn,m(k,l)  and  (f>n,m{k) 
are  the  7th  complex  fading  coefficient  and  the  phase  drift 
of  the  time-varying  subchannel  between  the  nth  transducer 
and  the  mth  hydrophone,  and  L  is  the  channel  length.  The 
phase  drift  is  a  combining  effect  of  the  average  Doppler 
shift,  fnm,  the  instantaneous  Doppler,  /„>m(fc),  and  the  coarse 
synchronization  phase  error,  0„>m,  and  can  be  expressed  as 
(fn,m{k)  =  2n[fnm  +  fn,m(k)]kTs  +  0n,m,  where  Ts  is  the 
symbol  interval.  The  term  vm(k)  is  the  sample  of  a  zero- 
mean  additive  white  Gaussian  noise  (AWGN)  with  power  af, 
on  the  mth  hydrophone.  For  practical  UWA  channels,  the 


EURASIP  Journal  on  Advances  in  Signal  Processing 


3 


fading  coefficient  hn,m(k,l)  usually  changes  slower  than  the 
instantaneous  phase  <jf>„,m(fc);  so  it  is  appropriate  to  treat  them 
separately  [17]. 

The  average  Doppler  shift  fn  m  is  caused  by  the  relative 
motion  between  the  transmitter  and  the  receiver,  while 
the  instantaneous  Doppler  modeled  as  a  zero-mean 

time-varying  random  variable  comes  from  the  dynamics  of 
the  oceans.  When  the  transmitter-receiver  relative  motion 
is  nonnegligible,  it  causes  the  transmitted  signal  to  be 
compressed  or  dilated  at  the  receiver  side  [22,  23].  In 
this  case,  the  effective  transmission  symbol  xn(k)  in  (1)  is 
not  exactly  the  same  as  the  originally  transmitted  symbol 
xn{k),  where  x„(k)  e  S  with  S  =  {yq}q= i  being  the 
modulation  constellation  of  size  Q.  As  a  result,  compensation 
of  the  average  Doppler  shift  fn  m  and  resampling  of  the 
received  signal,  herein  named  as  Doppler  preprocessing, 
are  necessary  before  signal  detection  can  be  performed.  An 
efficient  average  Doppler  shift  estimation  method  has  been 
provided  in  [22].  After  Doppler  preprocessing,  the  signal 
model  in  (1)  becomes 

N  L- 1 

y,n(k)  =  X  Y,h^{k,l)xn{k  -  /)eiW*0  +  vm(fc),  (2) 
n= 1  1=0 

where  ym(k),  hnttn(k,l),  </)„,m(k),  and  vm(k)  are  the  received 
symbol,  the  fading  coefficient,  the  phase  drift,  and  the 
additive  noise  after  Doppler  preprocessing,  respectively.  The 
phase  term,  <f>„,m{k),  may  contain  residual  Doppler  shift  effect 
due  to  nonideal  Doppler  shift  estimation  and  compensation. 
The  noise  vm(k)  is  still  zero-mean  AWGN  with  variance 
power  dy . 

3.  Channel  Estimation  for 

MIMO  UWA  Communications 

As  the  basis  for  equalizer  design,  channel  estimation  for 
UWA  communications  is  developed  in  this  section.  In 
training  mode,  MIMO  channel  estimation  is  performed  with 
pilotsymbols  {p„(k),0  <  k  <  Np}n=l  fromallN transducers. 
We  assume  that  the  time  duration  of  the  pilot  sequences  is 
less  than  the  channel  coherence  time,  in  which  case  the  fading 
coefficient  hn,m{k,l )  in  (2)  can  be  treated  as  approximately 
time-invariant,  that  is,  hmm(k,l)  ~  h,hm{l).  The  assumption 
is  appropriate  in  many  UWA  communications.  Then  (2)  can 
be  represented  in  matrix  form  as 

N 

Ym  U 7 ; h , .  , , ;  ~t  vffl,  (3) 

n=l 


where ym  =  [ym(L-l),  ym(L),. . .  ,ym(Np-  1)]*  e  e(N?“L+1)xl 
is  the  received  sample  vector  at  the  mth  hydrophone  with  ( ■ )' 
denoting  matrix  transpose  operation, 


is  the  matrix  obtained  with  the  nth  pilot  sequence, 
and  Q>n,m  =  dmg{e^n-m('L~1K  h„>m 

~  [fi«,m(0)> ■  ■  ■  > hn,m(L  —  1)]  ,  and  vm  —  [vm{L  — 
1),  vm(L),. . . ,  vm(Np  -  l)]f  are  the  phase  drift  matrix,  the 
fading  coefficient  vector,  and  the  noise  vector,  respectively. 
The  operator  diag{  - }  returns  a  diagonal  matrix.  It  is  still 
difficult  to  estimate  h„,m  and  <P„>m  simultaneously  from  (3), 
due  to  the  separation  of  0„_m  and  h„>m  by  P„  on  the  right- 
hand  side.  To  proceed,  we  rewrite  <P„,m  as 

®  n,m  = 

x  diag|e^[l|!’n'm(I,_1)_^n'”(I)1 ,  _  1 1 , 

where  the  index  I  in  is  determined  as  I  =  \(Np  + 

L  -  2)/2l  with  M  denoting  the  smallest  integer  larger 

than  x.  The  instantaneous  phase  drifts  {<t>„,m(k)} in  (5) 
could  be  significant.  However,  the  differential  phase  drifts 

{<tln,m{k)  -  0n,m CO will  be  insignificant  over  a  short 
period  of  time.  Thus  (5)  can  be  approximated  by  ~ 
I+1  being  an  identity  matrix  of  order  j. 

Then  (3)  can  be  approximated  by 

N 

Ym  ~  XVn\eM)K,m]  +  Vm 

»=1  (6) 

Phm +  vm, 

where  P  =  [Pi,  P2, . . . ,  PN]  and  h,„  =  [eJ^^Wh^, 

. . .  )e;fem(t)h^  ]f.  From  (6),  the  minimum  mean  square 
error  (MMSE)  estimation  of  hw  is  given  by 

hm=  (■phP  +  ^lNLy1phYm,  (7) 

where  ( ■ ),!  denotes  matrix  Hermitian  transpose.  The  estima¬ 
tion  in  (7)  is  performed  on  all  M  hydrophones  to  obtain 
MIMO  channel  estimation.  It  is  noted  that  to  guarantee  the 
system  equation  (6)  not  to  be  underdetermined,  a  minimum 
of  Alp  >  (N+lJL-l  pilot  symbols  are  required  for  each  of  the 
N  transducers.  Last,  the  same  channel  estimation  procedure 
discussed  above  will  also  be  adopted  in  the  decision-directed 
mode,  where  the  previously  detected  symbols  instead  of  the 
pilot  symbols  are  used  for  channel  estimation. 

4.  New  Equalization  Scheme  for 
MIMO  UWA  Communication 

A  new  equalization  scheme  is  discussed  in  this  section,  where 
the  conventional  equalization  is  first  adopted  to  cancel  the 
space-time  interference  among  transmitted  symbols,  and 
then  a  novel  phase  estimation  and  compensation  method  is 
applied  to  remove  the  phase  rotations  in  the  equalized  sym¬ 
bols.  The  layered  receiver  structure  adopting  the  proposed 
equalization  scheme  is  then  demonstrated. 
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4.1.  MIMO  Equalization  for  Space-Time  Interference  Cancela¬ 
tion.  With  the  estimated  MIMO  channel,  MIMO  equaliza¬ 
tion  can  be  performed.  MIMO  linear  equalizer  is  adopted  in 
this  paper.  In  this  case,  the  kth  equalized  symbol  of  the  nth 
transmission  stream  (from  the  nth  transducer)  is  given  by 

M  K2 

xn{k)  =  X  X  c(n?mym{k  -  q) ,  (8) 

m=  1  q=-K\ 

where  Ki,K2  are  nonnegative  integers,  and  cfh  denotes  the 
qth  linear  equalizer  coefficient  corresponding  to  the  received 
sample  of  the  mth  hydrophone  for  equalizing  symbols  of  the 
nth  stream.  Without  loss  of  generality,  the  same  values  of 
Ki , K2  are  used  for  all  («,  m)  pairs,  resulting  inNxMx  (Ki  + 
K2  +  1 )  equalizer  taps  in  total. 

Collecting  the  equalized  symbols  at  time  k,  {xn(k)}^=l, 
of  all  N  streams  in  a  vector,  leads  to  the  matrix-form 
equalization  model  as 


x(fc)  =  Cy,  (9) 

where  x(k)  =  [xY  (k),x2{k), . . .  ,xN(k)]‘  G  CNxl,y  =  [y ‘(k- 
K2),Yt(k-K2  + 1), . . .  ,y*(ifc), . . .  ^(k+K^]*  e  eM^+K2+1>xl, 
andC=  [c»),c(if2“1),...,c(“ifl)]  G  qNxmIki+k2+i)  with y(A:  — 
q)  G  CMxl  and  fT  G  CNxM  defined,  respectively,  as 


transmission  symbol  correlation  matrix  R,  and  the  signal-to- 
noise  ratio  (SNR)  f  are  referred  to  (A.4),  (A.6b),  and  (A.7), 
respectively.  Since  R  is  constant,  the  MIMO  LE  matrix  only 
depends  on  the  knowledge  of  MIMO  channel  and  SNR. 

Remark.  In  (13),  two  alternative  solutions  for  the  MIMO  LE 
matrix  C  are  provided.  In  the  first  solution  (first  equality), 
matrix  inversion  of  order  M(Ki  +  K2  +  1)  is  required.  In  the 
second  solution,  matrix  inversion  of  order  N (fCi  +  K2  +  L) 
is  involved.  Since  matrix  inversion  is  the  main  source  of 
the  computational  complexity,  the  solution  with  smaller- 
order  matrix  inversion  is  always  favored  in  practical  MIMO 
systems. 


With  the  designed  LE  matrix  in  (13),  equalization 
can  then  be  performed.  Substituting  (2)  into  (9),  the  kth 
equalized  symbol  of  the  nth  stream  (or  the  nth  element  in 
x(k))  is  expressed  as 

Xn{k) 


M 


=  s 


k2  n  l-  1 

X  ZZ  c{Sihi,m  ( k  -  q,  l)Xi  ( k 

q=-Ki  i=  1  1=0 


—  q  —  l)ejMk-<i> 


3”  r]n(k)y 


(14) 


y (k-q)  =  [yi(k- q),y2{k- q),...,yM(k- q)]1, 

(10a) 


rM 

(q) 

_(<?)  “1 

ci,i 

cl,2  ’ 

'  C1,M 

M) 

(?) 

M) 

c2,l 

c2,2  ' 

'  C2,M 

M 

(q) 

M) 

\-cN,l 

CN,  2  ’ 

’  CNM- 

To  solve  the  equalizer  matrix  C  in  (9),  the  mean  squared 
error  (MSE)  is  applied  leading  to  the  cost  function  defined 
as  follows: 

7(C)  =  E [ | |x(fc)  -  x(k)||2],  (11) 

where  ||a||  is  the  Euclidean  norm  of  vector  a,  and  E[  -  ] 
denotes  mathematical  expectation.  By  minimizing  /(C)  in 
(11),  the  MMSE  MIMO  LE  matrix  is  solved  as 


where  qn(k)  =  Xm=  1  Sfi-x,  c(n,Lvm(k  -  q)  is  the  collection 
of  additive  noise  {vm(k)}m=1  in  the  equalized  symbol.  As  we 
can  see  from  ( 14),  the  triple  summation  in  the  square  bracket 
is  the  mth  hydrophone’s  contribution  to  the  equalized 
symbol  of  the  nth  stream.  Therefore,  we  have  the  following 
definition: 

0Cn,m(k')Xn(k')  +  £ n,m(k ) 

=  X  X  “  q>l)xi(k  -  q  ~  l)ei(l,‘-m(k^q), 

q=-Ki  2=1  1=0 

(15) 

where  a,hm(k.)  denotes  the  scaling  factor  which  is  usually  a 
complex  value  closely  related  to  the  equalizer  taps  cfih  and 
instantaneous  phase  rotation  g70i.m(fc— q) ,  an(j  ^n_m(fc)  denotes 
residual  interference  from  symbols  other  than  xn{k).  With 
above  definition,  (14)  is  simplified  as 


Cmmse  =  E[x{k)yh][\E[yyh  ]}"  <’2> 

which  is  further  simplified  as 

Cmmse  =  R,'H,'  ^HH?1  +  -1M(Ki+k2+i)^ 

(13) 

=  R,^H,'H+ilW(ifI+x2+L)j  H?‘, 

where  the  derivation  details  are  referred  to  the  Appendix.  In 
(13),  the  definitions  of  the  channel  matrix  H,  the  normalized 


M 

Xn(k)  =  X  otn,m{k')Xn(.k)  +  fi(k)  +  qn(k) 

m  =  1  (16) 

=  \yn{k)\ejZy,,(k)xn{k)  +  („{k), 

where  y„(k)  =  £m=i  ocn,m(k)  is  the  diversity  combining 
gain  of  M  hydrophones,  ffk)  =  Sm=i?«,m(^)  is  the 

overall  residual  interference,  and  (n(k)  =  £„(k)  +  qn(k)  is 
the  effective  noise  consists  of  the  residual  interference  and 
additive  noise  in  the  equalized  symbol.  Obviously,  regardless 
of  the  effective  noise,  the  equalized  symbol  xn(k)  in  ( 16)  is  an 
amplitude-scaled  and  phase-rotated  version  of  the  original 


EURASIP  Journal  on  Advances  in  Signal  Processing 


5 


symbol  xn{k).  The  phase  rotation  Zyn(k)  is  a  complicated 
term  caused  by  phase  drift  (f>„im(k).  For  systems  employing 
coherent  modulation  schemes  like  phase  shift  keying  (PSK), 
the  phase  rotation  is  hostile  and  must  be  compensated,  which 
is  the  topic  of  the  next  subsection. 


Step  3.  Calculate  the  average  phase  deviation  and  estimate 
the  rotating  phase,  respectively,  for  the  gth  group  as 

1  Ns 

^fn(g)  =  (21) 

iVs/c=l 


4.2.  Groupwise  Phase  Estimation  and  Compensation.  The 
groupwise  phase  estimation  and  correction  method  pro¬ 
posed  in  [21]  is  adopted  to  handle  the  phase  rotations 
in  the  equalized  symbol  xn{k),  as  shown  in  (16).  The 
motivation  for  the  groupwise  phase  estimation  and  correc¬ 
tion  method  comes  from  the  fact  that  the  instantaneous 
phase  drift  0n,m(fc)  changes  gradually  while  not  arbitrarily 
from  time  to  time  due  to  the  nature  of  ocean  waters. 
In  other  words,  the  rotating  phase  Zyn(k)  tends  to  be  a 
constant  over  a  small  group  of  Ns  consecutive  equalized 
symbols.  The  procedure  for  performing  the  groupwise  phase 
estimation  and  compensation  algorithm  is  presented  in  the 
following. 


Initialization.  For  the  equalized  data  block  of  size  Nb, 
designate  the  first  Nts  symbols  {x„(k)}^“1  as  the  training 
symbols  for  phase  reference  and  determine  the  initial  phase 
fn{ 0)  by 


fn  (g)  =  fn{g  ~  !)  +  A  f„  (g)  .  (22) 


Step  4.  Increment  g  by  1,  repeat  Steps  1-3  till  g  =  Ng. 


After  estimating  the  phases  for  the  Ng  groups,  we  can 
compensate  the  phase  rotation  in  the  equalized  symbols  on 
group  basis  as 


xn(g,k)  =  xn(g,k)e 


k  =  1,2,...,  Ns, 
g  =  1,2,...  ,Ng. 


(23) 


The  phase-compensated  symbol  xn(g,k)  is  then  ready  for 
detection.  It  is  pointed  out  that  if  the  last  group  has  less  than 
Ns  symbols,  then  the  calculation  through  (19)-(23)  needs 
to  be  carried  out  based  on  the  actual  number  of  symbols. 
Finally,  the  groupwise  phase  estimation  method  takes  the 
advantage  of  insensitivity  to  noise  perturbations,  due  to  the 
averaging  operation  in  (21). 


fn( 0)  =  X  i^X„{k)  -  4lxn(k)]. 
™tsk=  1 


(17) 


Then,  partition  the  remaining  Nb  -  Nts  symbols  into  Ng 
groups,  each  having  Ns  symbols,  except  that  the  Ng th 
group  may  have  less  than  Ns  symbols.  Further,  for  an 
M- ary  PSK  (MPSK)  modulation  constellation  Sm  = 
{exp[(j(m  -  l)27r)/M]}^=I,  define  a  phase  quantization 
function  Q[-]  as  follows: 


(2m  -  3)7T  __  k  __  (2m  -  1)71 
2 (m  -  1)71  **  ~  V  < 


M 


M  r  M 
m  =  1,2 ,...  ,M. 


(18) 


Set  g  =  1. 

Step  1.  Compensate  the  phase  of  the  gth  group  data  by 
e^iVnlg-P,  yielding 


Xn(g,k) 

=  xn(Nts+(g-l)Ns  +  k)e-^-1),  k=l,2,...,Ns. 

(19) 

Step  2.  For  each  symbol  in  the  gth  group,  calculate  its  phase 
deviation  from  the  corresponding  nominal  phase  as 

<Pn(g,k)  =  Zx„(g,k)  -  Q [Zx„(g,k)],  k  =  1,2 ,...,NS. 

(20) 


4.3.  Layered  Space-Time  Processing.  Fayered  space-time  pro¬ 
cessing  was  first  proposed  in  [24],  for  frequency-flat  fading 
channels.  It  was  extended  to  frequency-selective  fading 
channels  in  [25].  The  basic  idea  of  the  layered  time-space 
processing  is  to  detect  multiple  data  streams  one  by  one  in  a 
specific  order  [26],  so  that  the  overall  detection  performance 
of  all  streams  can  be  improved  compared  to  joint  detection. 

In  [25],  the  layered  time-space  processing  is  operated 
with  the  ordered  successive  interference  cancelation  (OSIC), 
where  the  transmitted  streams  are  detected  in  an  order 
that  strong  streams  are  detected  earlier  than  weak  streams, 
and  each  stream  is  processed  with  the  interference  from 
all  previously  detected  streams  already  canceled  out.  For 
a  MIMO  UWA  communication  system  using  space-time 
trellis  coding  (STTC)  [27],  a  layered  receiver  structure 
adopting  the  proposed  equalization  scheme  discussed  above 
is  demonstrated  in  Figure  1,  where  N  stages  each  detecting 
one  of  the  N  transmission  streams  are  included.  At  each 
stage,  channel  equalization  and  phase  compensation  are 
operated  separately. 

The  key  for  the  success  of  OSIC-based  space-time 
processing  lies  in  the  successive  interference  reconstruction 
and  cancelation.  For  a  detection  order  {01, 02,. . . ,  On}  with 
on  denoting  the  index  of  the  stream  detected  at  stage  m(1  < 
n  <  N),  the  constructed  interference  of  the  on  th  stream  after 
it  is  detected  is  given  as 


1-1 

Ion,m(.k')  ^  h0rt^m  ( k ,  l)x0n  ( k ) 

1=0 


(1  <  m  <  M), 


(24) 


where  x0rt{k)  =  x0n(k)e^Zxort^~Zxori('k^  with  x0n(k )  and 
x0„  (k)  being  the  equalized  symbol  and  the  detected  symbol 
of  the  o„th  stream,  respectively,  as  shown  in  Figure  1. 
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Figure  1:  Layered  MIMO  receiver  architecture  adopting  the  proposed  equalization  (in  each  layer,  an  IC  equalizer  and  a  phase  compensator 
perform  interference  cancelation  and  phase  compensation  separately). 


The  reconstructed  interference  of  the  o„th  stream  is  then 
canceled  out  as 

y%\k)  =  yfr-'Hk)  -  (1  <  m  <  M),  (25) 

where  ym\k )  denotes  the  received  signal  with  the  inter¬ 
ference  of  the  previous  n  streams  already  subtracted  out 
and  is  ready  to  be  used  for  detecting  the  o„+ith  stream. 
When  detecting  the  Oith  stream,  we  have  ym(k)  =  ym(k). 
The  procedure  in  (24)-(25)  is  repeated  until  all  streams  are 
detected. 

5.  Experimental  Results 

The  layered  receiver  structure  adopting  the  proposed  equal¬ 
ization  scheme  as  shown  in  Figure  1  has  been  tested  by 
two  undersea  experiments:  Makai05  and  Unet06.  We  present 
the  details  on  data  processing  for  both  experiments  in  this 
section. 

5.1.  Results  of  Makai05  Experiment.  Single-band  (SB)  and 
multiband  (MB)  MIMO  underwater  experiments  were  con¬ 
ducted  off  the  northwestern  coast  of  Kauai,  Hawaii,  in 
September  2005.  In  MB  experiment,  six  signal  bands  each 
having  a  symbol  rate  of  2  kilo  symbols  per  second  (ksps)  were 
used.  The  adoption  of  MB  transmission  aims  to  reduce  the 
equivalent  symbol-spaced  channel  length  (thus  reduce  the 
equalization  complexity),  while  still  achieve  a  high  data  rate. 
Its  spectral  efficiency,  however,  is  lower  than  SB  transmission 
due  to  the  insertion  of  guard  bands  among  multiple  signal 
bands.  Signal  detection  for  MB  transmission  has  been 
presented  in  [17],  and  no  results  for  SB  transmission  have 
been  previously  reported  yet. 

In  this  paper,  we  focus  on  SB  experiment,  which  further 
includes  low  band  (LB)  transmission  and  high  band  (HB) 
transmission.  In  LB  transmission,  the  carrier  frequency  was 


Figure  2:  Signalling  at  the  transmitter  in  Makai05  experiment. 


fc  =  32  kHz  and  the  symbol  interval  was  0. 1  ms.  The 
occupied  channel  bandwidth  was  fi,  =  14  kHz  due  to  the 
use  of  a  pulse  shaping  filter  with  roll-off  factor  0.4.  In 
HB  transmission,  the  carrier  frequency  was  fc  =  37.5  kHz 
and  the  symbol  interval  was  0.05  ms.  The  occupied  channel 
bandwidth  was  fi,  =  25  kHz  due  to  a  pulse  shaping  filter  with 
roll-off  factor  0.25.  The  modulations  included  BPSK,  QPSK, 
and  8PSK  for  both  LB  and  HB  transmissions.  The  transmitter 
was  a  ten-transducer  array  with  2  meters  separation  between 
adjacent  transducer  elements  and  was  deployed  over  the  side 
of  the  Kilo  Moana  research  vessel.  The  receiver  consisted 
of  eight  hydrophones  with  2  meters  separation  between 
adjacent  hydrophone  elements  and  was  allowed  to  drift 
freely.  The  transmission  range  was  2  kilometers  (km). 

In  Figure  2,  the  signalling  for  a  two-transducer  transmis¬ 
sion  is  shown.  From  the  figure,  the  binary  stream  a  is  input 
to  the  symbol  mapping  module  with  the  output  modulation 
symbol  stream  denoted  by  s.  The  symbol  stream  s  is  then 
encoded  by  space-time  trellis  coding,  and  the  encoded  stream 
is  demultiplexed  into  two  substreams  Si  and  s2  for  two 
transducers.  In  this  experiment,  the  four-state  BPSK,  four- 
state  QPSK,  and  eight-state  8PSK  STTC  codes  provided  in 
[27]  were  used,  all  for  two-transducer  transmission.  Symbol 
interleavers  (n)  were  optionally  used  to  transform  Si  and  s2 
into  xi  and  x2  (si  =  xi  and  s2  =  x2  when  symbol  interleaving 
was  not  applied).  For  the  transmission  with  more  than  two 
transducers,  the  grouped  STTC  was  used,  with  the  details 
referred  to  [17]. 
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The  symbol  stream  was  transmitted  in  packets.  In 
Figure  3,  the  MIMO  packet  structure  is  shown.  From  the 
figure,  the  packet  consists  of  three  parts:  the  first  part  con¬ 
tains  a  sequence  of  probe  signals  transmitted  by  transducer 
one  to  transducer  ten,  plus  a  0.2-second  clear  time.  Each 
probe  signal  has  a  duration  of  0.4  seconds  and  contains 
a  linear  frequency  modulation  (EFM)  signal  for  synchro¬ 
nization.  The  second  part  contains  data  payload  transmitted 
simultaneously  by  designated  transducers  depending  on  the 
experimental  configuration.  It  has  a  time  duration  of  4.8 
seconds.  At  the  receiver  side,  part  of  the  transmitted  symbols 
will  be  used  as  pilot  symbols  for  channel  estimation,  which 
will  become  evident  shortly.  The  last  part  is  a  3-second  clear 
time  used  for  transmission  system  resynchronization.  The 
whole  packet  has  a  time  duration  of  12  seconds. 

The  signal  detection  procedures  for  LB  transmission  and 
HB  transmission  are  similar,  and  we  present  the  details  for  LB 
transmission  without  loss  of  generality.  As  mentioned  above, 
the  LFM  signal  contained  in  the  probe  signal  is  used  for 
packet  synchronization  due  to  its  good  correlation  property. 
Moreover,  the  channel  length  L  can  also  be  estimated  with  the 
LFM  signals  by  measuring  the  span  of  the  significant  LFM 
correlation  [11].  In  Figure  4,  an  example  of  the  normalized 
LFM  correlation  is  depicted.  It  is  clearly  shown  that  there 
exists  a  correlation  peak  indicating  the  synchronization 
point.  The  significant  correlation  spans  a  range  about  10  ms, 
over  which  most  of  the  channel  energy  is  concentrated.  The 
channel  length  is  thus  estimated  as  L  =  100,  in  terms  of  LB 
symbol  interval  Ts  =  0. 1  ms. 

Once  the  packet  synchronization  was  achieved,  the 
MIMO  channel  was  initially  estimated  using  pilot  symbols 
located  at  the  front  of  the  data  payload  with  the  method 
described  in  Section  3.  For  the  channel  length  of  L  = 
100  requiring  a  minimum  training  length  of  299  with 
two-transducer  MIMO  transmission,  a  larger  pilot  length 
Np  =  600  was  selected  so  as  to  obtain  an  accurate  channel 
estimation  and  also  mitigate  the  effect  of  additive  noise. 
Such  choice  of  pilot  length  corresponds  to  a  time  duration 
of  60  ms,  which  is  less  than  the  channel  coherence  time 
about  200  ms  corresponding  to  a  maximum  Doppler  spread 
between  2  ~  3  Hz  in  this  experiment.  The  approximation  of 
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Figure  4:  An  example  of  normalized  LFM  correlation. 


phase  drift  in  (6)  is  also  appropriate  in  this  case,  since  the 
maximum  possible  differential  phase  drift  is  less  than  n/8. 
Figures  5  and  6  show  two  examples  of  the  estimated  channel 
impulse  responses  (CIRs)  for  the  two-transducer  eight- 
hydrophone  MIMO  transmissions  with  BPSK  and  QPSK 
modulations,  respectively.  Obviously,  all  subchannels  are 
sparse  with  two  distinct  peaks  over  the  depicted  delay  spread. 
The  estimated  MIMO  channel  with  8PSK  modulation  was 
similar  to  that  depicted  in  Figure  6  and  is  not  shown  for 
brevity.  The  estimated  channel  was  also  used  to  determine 
the  detection  order  in  the  layered  space-time  processing.  For 
each  stream  (transducer),  its  related  channel  energy  is  defined 
as  the  summation  of  all  related  subchannel  energies.  Then  a 
simple  ordering  criterion  is  to  compare  the  related  channel 
energies  among  streams,  and  the  stream  having  larger  related 
channel  energy  is  detected  earlier. 

Periodic  pilot  sequences  were  inserted  to  divide  the 
data  payload  into  frames,  and  each  frame  was  artificially 
partitioned  into  blocks  for  processing,  as  shown  in  Figure  7. 
We  chose  the  block  size  Nf,  =  200,  corresponding  to  20  ms. 
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CIR:  Tx  1  to  Rx  1 


CIR:  Tx  2  to  Rx  1 


(a) 


CIR:  Tx  1  to  Rx  4 


CIR:  Tx  1  to  Rx  8 


(b) 


CIR:  Tx  2  to  Rx  4 


(d) 

CIR:  Tx  2  to  Rx  8 


(e)  (f) 

Figure  5:  Estimated  channel  impulse  responses  for  two-transducer  eight-hydrophone  transmission  with  BPSK  modulation. 


The  initial  MIMO  channel  estimation  was  employed  to 
detect  the  block  following  the  first  pilot  sequence,  using 
the  layered  detection  scheme.  In  each  layer,  Nj,  symbols 
of  the  corresponding  stream  were  first  equalized.  The  LE 
parameters  Ki  and  K2  in  (8)  were  selected  as  K\  =  K2  = 
L  —  1.  An  interesting  observation  was  that  using  smaller 
K2  <  L  only  slightly  degraded  the  equalization  performance 
while  decreased  the  computational  complexity  considerably. 
The  Nb  equalized  symbols  were  then  fed  into  the  phase 
compensation  unit,  as  shown  in  Figure  1.  In  the  operation 
of  groupwise  phase  estimation  and  correction,  the  group 
size  Ns  =  20  was  used,  and  there  were  Ng  =  10  groups  in 
one  block.  After  phase  compensation,  the  Nb  symbols  were 
detected.  The  detected  symbols  together  with  the  equalized 
symbols  were  also  used  to  construct  the  interference  of  the 
detected  stream,  which  was  subtracted  out  of  the  received 
signal  for  detecting  the  next  stream.  The  block  processing 
was  finished  when  all  N  streams  were  detected.  To  effectively 
track  the  time  variation  of  MIMO  channel,  Np  previously 
detected  symbols  (part  of  them  could  be  pilot  symbols) 
of  all  streams  were  used  to  reestimate  the  channel,  as 
shown  in  Figure  7.  The  updated  channel  was  then  used  to 
detect  the  current  data  block  in  a  similar  way  mentioned 
above.  The  channel  reestimation  and  detection  procedure 
continued  until  the  next  pilot  sequence  started.  The  insertion 


of  periodic  pilot  sequences  aimed  to  combat  possible  error 
propagation  of  decision-directed  channel  reestimation,  and 
the  new  frame  was  processed  in  the  same  way  as  the  previous 
one.  The  frame  size  was  selected  to  achieve  a  tradeoff 
between  detection  performance  and  training  overhead.  In 
the  processing,  we  selected  Nf  =  16, 000  which  incurred  only 
5%  overall  training  overhead,  comparing  to  20%  training 
overhead  with  conventional  algorithms  [17,  28].  Once  an 
entire  packet  was  detected,  the  N  symbol  streams  were 
deinterleaved  (if  the  symbol  interleaver  was  used)  and  then 
sent  to  the  STTC  decoding  and  symbol  demapping  module 
to  produce  the  decoded  information  bit  stream,  as  shown  in 
Figure  1.  The  STTC  decoding  was  easily  implemented  with 
Viterbi  algorithm. 

In  Figure  8,  the  sequence  of  estimated  CIRs  obtained 
during  packet  detection  is  depicted,  where  the  temporal 
variations  of  the  four  subchannels  are  clearly  shown  over 
the  depicted  time  interval  within  one  packet.  In  Figure  9, 
the  scatter  plot  shows  two  channels  (hydrophone  one  and 
hydrophone  eight)  of  received  baseband  signal  and  the 
equalized  and  phase-corrected  BPSK  symbols  of  transducer 
one  for  a  2  X  8  MIMO  packet.  In  Figure  9(c),  the  phase  rota¬ 
tions  in  the  equalized  symbols  are  obvious  when  comparing 
with  the  phase-corrected  symbols  shown  in  Figure  9(d).  The 
phase-corrected  symbols  have  been  properly  classified  into 
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CIR:  Tx  1  to  Rx  1  CIR:  Tx  2  to  Rx  1 
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Figure  6:  Estimated  channel  impulse  responses  for  two-transducer  eight-hydrophone  transmission  with  QPSK  modulation. 
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Figure  7:  Partition  of  the  transmitted  data  payload  for  block  detection  at  the  receiver  (the  same  structure  applied  to  all  N  transmission 
streams). 


Table  1:  BER  of  LB  2  x  8  MIMO  with  BPSK  Modulation. 


Packet  index 

Uncoded  BER  of 

Tx  1 

Uncoded  BER  of 

Tx  2 

Uncoded  BER  of 

Tx  1  and  2 

BER  after  STTC 
decoding 

1 

2.800e-3 

4.178e-3 

3.489e-3 

2.000e-4 

2 

4.237e-4 

1.695e-4 

2.966e-4 

0 

3 

1.556e-3 

9.333e-4 

1.244e-3 

l.llle-4 

4 

1.316e-4 

5.482e-4 

3.399e-4 

0 

5 

2.412e-4 

1.096e-4 

1.695e-4 

6.579e-5 

6 

4.240e-5 

1.271e-4 

8.475e-5 

0 

7 

8.114e-4 

5.044e-4 

6.579e-4 

6.579e-5 

8 

1.059e-4 

6.360e-5 

8.475e-5 

0 

Mean 

7.645e-4 

8.294e-4 

7.954e-4 

5.533e-5 
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Figure  8:  Demonstration  of  channel  time  variation  within  one  packet  (BPSK  modulation). 


Table  2:  BER  of  LB  2  x  8  MIMO  with  QPSK  Modulation. 


Packet  index 

Uncoded  BER  of 

Txl 

Uncoded  BER  of 

Tx  2 

Uncoded  BER  of 

Tx  1  and  2 

BER  after  STTC 
decoding 

1 

1.656e-3 

1.756e-l 

8.863e-2 

6.984e-2 

2 

1.126e-4 

1.707e-2 

8.592e-3 

1.374e-3 

3 

1.351e-4 

1.498e-3 

8.164e-4 

2.252e-5 

Mean 

6.346e-4 

6.472e-2 

3.268e-2 

2.375e-2 

Table  3:  BER  of  LB  2x8  MIMO  with  8PSK  Modulation. 

Packet  index 

Uncoded  BER  of 

Txl 

Uncoded  BER  of 

Tx  2 

Uncoded  BER  of 

Tx  1  and  2 

BER  after  STTC 
decoding 

1 

8.213e-3 

1.614e-l 

8.48  le-2 

2.508e-2 

2 

1.669e-2 

1.199e-l 

6.831e-2 

1.265e-2 

Mean 

1.245e-2 

1.407e-l 

7.656e-2 

1.887e-2 
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Equalized  symbols  of  Tx  1  (BPSK) 


Real  part 


Oh 


Phase-corrected  symbols  ofTx  1  (BPSK) 


-2-10  1  2 
Real  part 
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Figure  9:  Received  baseband  signal,  equalized  and  phase-corrected  BPSK  symbols. 


two  clusters  centered  at  the  two  nominal  BPSK  modulation 
symbols  -1  and  +1.  Similar  observations  are  found  with 
the  equalized  and  phase-corrected  QPSK  and  8PSK  symbols, 
as  depicted  in  Figure  10.  All  demonstrations  have  therefore 
verified  the  necessity  and  effectiveness  of  the  phase  compen¬ 
sation  operation. 

From  the  available  experimental  data,  we  had  eight  2x8 
BPSK  packets,  three  2x8  QPSK  packets,  and  two  2x8  8PSK 
packets  for  LB  transmission.  We  processed  all  these  packets 
with  the  proposed  detection  scheme,  and  the  bit  error  rate 


(BER)  results  for  the  2x8  MIMO  transmission  with  BPSK, 
QPSK,  and  8PSK  modulations  are  listed  in  Table  1  through, 
Table  3.  It  is  noted  that  symbol  interleaving  was  applied  only 
to  even-indexed  packets  in  all  three  tables.  From  these  tables, 
we  make  three  observations.  First,  the  BER  performance 
degrades  when  the  size  of  the  constellation  size  increases, 
by  comparing  the  average  uncoded  BERs  listed  in  the  last 
lines  of  the  three  tables.  Second,  the  average  uncoded  BERs 
of  the  two  streams  are  comparable  with  BPSK  modulation, 
while  the  first  stream  has  better  average  uncoded  BER  than 
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Equalized  symbols  of  Tx  1  (QPSK) 
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Figure  10:  Equalized  and  phase-corrected  QPSK  and  8PSK  symbols. 


0 

Real  part 


Table  4:  BER  of  HB  2  x  8  MIMO  with  BPSK  Modulation. 


Packet  index 

Uncoded  BER  of 

Tx  1 

Uncoded  BER  of 

Tx  2 

Uncoded  BER  of 

Tx  1  and  2 

BER  after  STTC 
decoding 

1 

1.190e-4 

1.212e-3 

6.656e-4 

1.082e-4 

2 

9.740e-5 

7.359e-4 

4.167e-4 

0 

3 

1.840e-4 

1.190e-3 

6.872e-4 

7.576e-5 

4 

7.325e-3 

7.993e-3 

7.659e-3 

1.064e-3 

5 

6.494e-5 

1.027e-2 

5.168e-3 

4.361e-3 

6 

1.447e-2 

2.103e-2 

1.775e-2 

2.597e-4 

Mean 

3.710e-3 

7.072e-3 

5.391e-3 

9.781e-4 
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Figure  11:  SIMO  packet  structure  in  Unet06  experiment. 
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Figure  12:  Channel  scattering  functions. 


that  of  the  second  stream  with  QPSK  and  8PSK  modulations. 
This  phenomenon  is  well  explained  by  the  fact  that  the  two 
streams  have  similar  related  channel  energy  in  BPSK  case  as 
shown  in  Figure  5.  In  contrast,  stream  one  has  a  larger  related 
channel  energy  than  stream  two  in  QPSK  case,  as  shown 
in  Figure  6.  Third,  the  BER  performance  of  all  packets  is 
improved  with  STTC  decoding.  The  even-indexed  packets 
have  more  significant  improvement  after  STTC  decoding 
than  the  odd-indexed  packets,  due  to  the  use  of  symbol 
interleaving. 

We  finally  presented  the  results  for  HB  transmission.  The 
channel  length  was  measured  as  L  =  150  compared  to  L  = 
100  with  EB  transmission,  due  to  the  smaller  symbol  interval 
in  this  case.  The  BER  results  for  HB  transmission  are  shown 
in  Table  4  with  BPSK  modulation.  The  average  uncoded  BER 
is  on  the  order  of  5  X  10~3,  which  is  not  as  good  as  that  of 
Table  1  due  to  the  increased  transmission  rate. 


5.2.  Results  of  Unet06  Experiment.  SIMO  UWA  communi¬ 
cation  experiments  with  both  moving-source  transmission 


and  fixed-source  transmission  were  conducted  at  Saint 
Margaret’s  Bay,  Nova  Scotia,  Canada,  in  May  2006.  For 
moving-source  transmission,  the  transmitter  was  deployed 
21  meters  deep  in  the  water  and  towed  at  a  speed  up  to 
4  knots.  The  communication  distance  was  ranging  from 
1  km  to  3  km.  For  fixed-source  transmission,  the  transmitter 
was  suspended  in  water  at  21  meters  depth  and  44  meters 
above  the  sea  bottom,  and  the  transmission  range  was  fixed 
as  3.06  km.  The  receiver  consisted  of  eight  hydrophones 
arranged  unequally  on  a  1.86-meter  vertical  array,  which  was 
suspended  30  meters  deep  in  the  water.  QPSK  modulation 
was  used  with  a  symbol  rate  of  4  ksps.  The  carrier  frequency 
was  fc=  17  kHz. 

The  structure  of  the  transmission  packet  is  depicted  in 
Figure  11.  It  starts  with  an  LFM  signal  named  LFMB  and 
ends  with  another  LFM  signal  called  LFME.  An  m-sequence 
of  length  1023  is  embedded  in  the  packet,  and  the  data 
payload  carries  40397  symbols.  The  whole  packet  has  a  time 
duration  of  15  seconds. 

Without  loss  of  generality,  the  processing  details  on 
moving-source  packets  are  presented.  As  before,  packet 
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Figure  13:  Partition  of  the  transmitted  data  payload  for  block  detection  at  the  receiver. 


Table  5:  Doppler  shift  estimation  in  moving-source  transmission 
(Hz). 


1 

2 

3 

4 

5 

6 

7 

i 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.29 

2 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.38 

3 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.02 

4 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.02 

5 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.46 

6 

16.16 

15.71 

12.44 

10.89 

10.79 

11.25 

8.36 

7 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.26 

8 

15.98 

15.71 

12.44 

10.80 

10.79 

11.25 

8.29 

Table  6:  Average  uncoded  BER  for  Unet06  Experiment. 


Number  of 
combining 
channels 

Moving  Source 

Fixed  Source 

1 

3.027e-l 

3.135e-l 

2 

2.778e-l 

2.139e-l 

3 

2.083e-2 

1.064e-2 

4 

1.017e-2 

1.740e-3 

5 

3.455e-3 

1.490e-4 

6 

7.399e-4 

1.366e-4 

7 

7.222e-4 

6.621e-5 

8 

6.248e-4 

3.104e-5 

synchronization  was  achieved  and  the  channel  length  L 
was  estimated,  both  with  the  help  of  LFMB  signal.  The 
equivalent  symbol-spaced  channel  length  was  estimated  as 
L  =  20  in  moving-source  transmission.  The  LFMB  signal 
together  with  the  LFME  signal  determined  the  received 
packet  duration,  which  was  used  to  estimate  the  average 
Doppler  shift  [22].  In  Table  5,  Doppler  shift  estimations  are 
listed  for  seven  moving-source  packets.  For  each  packet, 
the  estimation  was  performed  independently  on  each  of  the 
eight  channels  (hydrophones).  Obviously,  the  Doppler  shift 
is  nonnegligible  compared  to  the  carrier  frequency  of  17  kHz. 
The  visual  demonstration  of  the  Doppler  effect  resorts  to  the 
channel  scattering  function,  which  was  estimated  with  the 
m-sequence  attributing  to  its  sensitivity  to  Doppler  spread. 
In  the  left  subfigure  of  Figure  12,  the  scattering  function  of 
the  first  moving-source  packet  in  Table  5  is  demonstrated. 


It  is  obvious  that  the  Doppler  spectrum  centers  around 
the  average  Doppler  shift.  For  comparison,  the  scattering 
function  for  a  fixed-source  packet  is  also  shown  in  the 
right  subfigure.  Since  there  is  no  relative  transceiver  motion, 
the  Doppler  spectrum  centers  around  zero  in  this  scenario. 
The  average  Doppler  shift  was  compensated  in  the  moving- 
source  packet  before  detection  started. 

For  the  channel  length  of  L  =  20  requiring  a  minimum 
training  length  of  39  with  SIMO  transmission,  a  larger 
training  length  Np  =  200  was  selected.  This  choice  of 
training  length  corresponds  to  a  time  duration  of  50  ms, 
which  is  less  than  the  channel  coherence  time  about  200  ms 
for  a  maximum  Doppler  spread  between  2  ~  3  Hz,  as 
shown  in  Figure  12  (in  moving- source  case,  the  Doppler 
shift  demonstrated  in  Figure  12(a)  will  be  compensated). 
The  maximum  differential  phase  drift  in  this  case  is  less 
than  7t/8  and  the  approximation  for  phase  drift  in  (6)  is 
appropriate.  Similar  to  MIMO  case,  the  received  packet  was 
partitioned  into  blocks  for  processing  with  a  block  size  of 
Nb  =  400,  which  is  0. 1  seconds  of  time  duration.  Only  one 
pilot  sequence  at  the  head  of  the  data  payload  were  used 
to  obtain  the  initial  MIMO  channel,  and  the  last  Np  = 
200  symbols  in  the  previously- detected  block  was  used  to 
reestimate  the  channel,  leading  to  the  partition  structure 
shown  in  Figure  13.  In  the  groupwise  phase  estimation  and 
compensation,  the  group  size  of  Ns  =  40  was  selected,  and 
there  were  Ng  =  10  groups  in  one  block. 

The  scatter  plots  in  Figures  14  and  15  demonstrate  the 
equalized  and  phase-corrected  QPSK  symbols  for  moving- 
source  packet.  From  Figure  14,  the  phase  rotations  in  the 
equalized  symbols  are  even  more  significant  compared  to  the 
MIMO  results  shown  in  Figures  9  and  10.  Regardless  of  the 
different  experimental  environments,  the  reason  lies  in  the 
adoption  of  larger  block  size  of  Nb  =  400,  compared  to  Nb  = 
200  used  in  MIMO  case,  during  detection.  The  selection  of 
larger  block  size  decreases  the  complexity  for  channel  adap¬ 
tion,  while  it  increases  the  accumulation  of  phase  rotations 
in  the  equalized  symbols.  From  Figure  15,  the  symbols  are 
well  classified  after  phase  correction,  which  indicates  that 
the  adopted  phase  estimation  and  compensation  algorithm 
is  effective  and  robust  under  different  channel  conditions. 

The  uncoded  BER  results  for  UNet06  experiment  are 
listed  in  Table  6.  In  total,  seven  moving-source  packets  and 
six  fixed-source  packets  were  processed.  The  processing 
procedure  for  fixed-source  packets  was  similar  to  that  of 
moving-source  packet,  except  that  the  channel  length  was 
L  =  60  in  this  case.  The  listed  BERs  are  averaged  ones  over 
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Equalized  signals  of  8  channels 


Real 

Figure  14:  Scatter  plot  of  equalized  QPSK  symbols  using  eight 
channels  in  Unet06  experiment. 


Phase-corrected  signals  of  8  channels 


Real 

Figure  15:  Scatter  plot  of  phase-corrected  QPSK  symbols  using 
eight  channels  in  Unet06  experiment. 


all  packets.  To  demonstrate  the  diversity  gain,  the  results 
corresponding  to  different  number  of  combining  channels 
are  all  shown.  Two  observations  are  made.  First,  the  fixed- 
source  transmission  has  better  uncoded  BER  performance 
than  the  moving-source  transmission,  due  to  the  absence  of 
Doppler  shift  in  the  received  signal.  Second,  it  is  obvious  that 
the  uncoded  BER  decreases  when  the  number  of  channels 
used  increases  due  to  the  increased  diversity  gain.  When  all 


eight  channels  are  used,  the  best  uncoded  BER  is  achieved  for 
both  transmissions. 

6.  Conclusion 

We  have  demonstrated  a  new  time-domain  MIMO  equal¬ 
ization  scheme  for  high  data  rate  single-carrier  underwater 
acoustic  communications,  where  the  separate  interference 
cancelation  and  phase  compensation  operations  have  been 
performed  during  detection.  MIMO  linear  equalization  was 
operated  in  an  ordered  successive  interference  cancelation 
fashion  to  achieve  enhanced  performance.  A  novel  groupwise 
phase  estimation  and  compensation  algorithm  was  used 
to  remove  the  phase  distortion  in  the  equalized  symbols. 
MIMO  channel  was  estimated  with  pilot  symbols  in  training 
mode  and  was  tracked  using  previously  detected  symbols 
in  decision-directed  mode,  incurring  no  more  than  5% 
training  overhead,  compared  to  20%  ~  30%  in  existing 
receiver  designs.  The  proposed  equalization  scheme  was 
tested  by  extensive  experimental  data  measured  off  the 
northwestern  coast  of  Kauai,  Hawaii,  in  September  2005, 
and  at  Saint  Margaret’s  Bay,  Nova  Scotia,  Canada,  in  May 
2006.  Processing  results  have  shown  the  effectiveness  and 
robustness  of  the  proposed  new  equalization  scheme  under 
different  UWA  transmission  conditions. 

Appendix 

A.  Calculation  of  MIMO  LE  Matrix 

The  MIMO  LE  matrix  given  in  (12)  has  implicit  forms 
which  are  difficult  for  evaluation.  Therefore,  it  is  desirable 
to  provide  explicit  solutions  directly  relating  to  the  channel 
knowledge.  To  achieve  that,  we  first  represent  y (k  -  q )  of 
(10a)  in  the  following  form: 


i—i 

y {k  ~  q)  =  ]Th(Z)x(/c  -  q-l)  +\{k-  q),  (A.l) 

l=o 

where  x(k  -  q  -  l)  =  [xi(k  —  q  —  l),xi(k  -  q-  l),.. .,xx(k  - 
q  -  l)]1,  \(k  —  q)  =  [vi  (k  -  q),v2(k  -  q), . . .  ,vM(k  -  q)}‘ ,  and 


H(Z) 


kid)  kid) 
kid)  kid) 


hN,id) 
kad ) 


(A.2) 


fiUMd)  kjad)  ■  •  ■  kjad) 


with  hn,md)  =  hnimd)e^"’m^  being  the  effective  channel 
coefficient  combining  the  fading  tap  with  phase  drift.  Herein, 
the  index  I  is  determined  as  I  =  (2k  +  Ki  —  K{)I2  similar 
to  that  in  (5),  and  the  approximation  leading  to  (6)  is  also 
used  to  validate  (A.l).  Then,  we  are  able  to  express  y  e 

(pM(Xi+jc2-n)xi  jn  (g)  as 


y  =  Hx  +  v,  (A.  3) 

where  H  e  (3m(K+JC2+i)xn(jCi+x2+i)  js  given  jn  the  following: 
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H  = 


H (L-  1)  H(L  -  2)  ■  ■  ■  H(0)  0  0 

0  H(L  -  1)  H(L  -  2)  ■  ■  ■  H(0)  0 

0  0  ■  ■  ■  0  H(L  -  1)  H(L  -  2) 


0 

0 

H(0) 


(A.4) 


and  x  =  [xf(fc  -  K2  -  L  +  1  ),xt(k  -  K2  -  L  + 
2 xf(/c  +  ECi ) ] f  e  (5N(Xi+x2+i,)xi  js  a  yg^oj-  containing 
transmission  symbols  of  all  N  transducers  during  the 
period  [k  -  K2  -  L  +  l,k  +  IG],  and  the  noise  vector  is 
defined  as  v  =  [\‘{k  -  K2), \*{k  -  K2  +  1), . . .  ,v\k  +  K\)]‘  e 
gM(Xi+x2+i)xi_  ]vj0W)  ^  substituting  y  in  (12)  with  (A.3),  we 
obtain 

Cmmse  =  e[x(/c)x,']h,'{he[xx;i]h?1  +  e[w'']}_1. 

(A.  5) 

In  (A.5),  we  still  need  to  evaluate  EIxx*],  Efxjfcjx*1]. 
Herein,  the  assumption  that  the  transmitted  symbols  are 
uncorrelated  both  spatially  and  temporally  is  used.  In  this 
case,  we  have  E [xi(k)xj(l)]  =  afS{i- j)S(k  - 1)  with  of  being 
the  power  of  transmission  symbol  xn(k),  and 

Rx  ^  E[xx'!]  =  02xlN(Kl+K2+L)  e  CNiK^DxNiK^L), 


(A.6a) 


Rh  =  -’^[x^x'1] 

=  [0nx(jc2+l-i)w  In  OnxXin]  e  cNxNfKl+Kl+L\ 

Defining  the  SNR  /3  as 


(A.6b) 


(A.  7) 


and  combining  (A.6a)-(A.6b),  we  finally  express  the  LE 
matrix  in  (A.5)  as  follows: 


Cmmse  =  R,!H,!  I  HHft  +  ^Im(X!+x2+i) 


1, 


(A.  8) 


=  R 1 1  H  !H  +  —Ijs^Ki+k2+l)  I  H'1. 
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